PASS2: A Database of Structure-Based Sequence Alignments of Protein Structural Domain Superfamilies

نویسندگان

  • Karuppiah Kanagarajadurai
  • Singaravelu Kalaimathy
  • Paramasivam Nagarajan
  • Ramanathan Sowdhamini
چکیده

Sequence alignments guided by structural features are particularly suited for distant relationships and they permit a better sampling of the protein sequence space. Reliable sequence alignments could be useful in evolutionary biology and in defining structurefunction relationships for protein superfamilies. PASS2 database presents structure-based alignments of protein domains related at the superfamily level. The number of new superfamilies was increased by 47% compared to the previous PASS2 2003 version, which shows the crucial need for updating the PASS2 database. In the current release of PASS2, the single-member, two-member and multimember superfamilies go through different pipelines in the structural alignment protocol. We have also introduced the alignment assessment methods such as, mean-RMSD and percentage of secondary structural equivalences which should be effective in producing better alignment with minimal intervention. New features such as sequence clustering, length-conserved blocks, secondary structural equivalence and outlier-recognition are also provided. We suggest that the structurally deviant superfamily members could be removed as outliers such that extreme distant relationships will not obscure the alignment. We thus report an nearly-automated, updated version of the superfamily alignment database, consisting of 1776 superfamilies and 9536 protein domains, that is in direct correspondence with SCOP (1.73) database. The database of alignments can be accessed at http://caps.ncbs.res.in/pass2/pass2.html

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PASS2 version 4: An update to the database of structure-based sequence alignments of structural domain superfamilies

Accurate structure-based sequence alignments of distantly related proteins are crucial in gaining insight about protein domains that belong to a superfamily. The PASS2 database provides alignments of proteins related at the superfamily level and are characterized by low sequence identity. We thus report an automated, updated version of the superfamily alignment database known as PASS2.4, consis...

متن کامل

LenVarDB: database of length-variant protein domains

Protein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels. LenV...

متن کامل

Rebelling for a Reason: Protein Structural “Outliers”

Analysis of structural variation in domain superfamilies can reveal constraints in protein evolution which aids protein structure prediction and classification. Structure-based sequence alignment of distantly related proteins, organized in PASS2 database, provides clues about structurally conserved regions among different functional families. Some superfamily members show large structural diffe...

متن کامل

S4: structure-based sequence alignments of SCOP superfamilies

S4 is an automatically generated database of multiple structure-based sequence alignments of protein superfamilies in the SCOP database. All structural domains that do not share more than 40% sequence identity as defined by the ASTRAL compendium of protein structures are included. The alignments are constructed using pairwise structural alignments to generate residue equivalences that are then ...

متن کامل

FunTree: a resource for exploring the functional evolution of structurally defined enzyme superfamilies

FunTree is a new resource that brings together sequence, structure, phylogenetic, chemical and mechanistic information for structurally defined enzyme superfamilies. Gathering together this range of data into a single resource allows the investigation of how novel enzyme functions have evolved within a structurally defined superfamily as well as providing a means to analyse trends across many s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IJKDB

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2011